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between the Grothendieck's inequality and AQBC. 
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I. INTRODUCTION 



Suppose we are given a quantum system with a d- 
dimensional Hilbert space rid, and the state (or density 
operator) of the system is given by a trace one posi- 
tive operator p from the linear operator space L('H ( j). 
Quantum channels, or trace-preserving completely pos- 
itive maps, are all possible deterministic quantum op- 
erations one can perform over the system 0, Let 
4> be such a quantum channel over L("H,i) with Kraus 

operator sum representation $ = J2k^ k ' ^k' anc ^ ^ 
K(&) — span{£fc} be its Kraus operator space. The con- 
vex hull of unitary channels (noiseless channels) on L(Hd) 
is given by Conv(U(% d )). So any * € Conv(U(H d )) can 
be written as a mixture (convex combination) of uni- 
tary channels. (The number of unitary channels in the 
mixture can be made finite due to the Caratheodory's 
theorem on convex hull). The mixture of unitary chan- 
nels plays a special role in environment-assisted quan- 
tum communication model. Actually, these channels can 
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be made noiseless for quantum information transmission 
with the help of a friendly environment even in one-shot 
case. Furthermore, it turns out that these channels are 
the only quantum channels having this desirable prop- 
erty Q . Surprisingly, if arbitrarily large number of uses 
of the channels are allowed, unital quantum channels, 
those channels $ with identity operator a fixed point, 
say <&(/) = /, can also achieve maximum capacity and 
act exactly like noiseless channel 4]. 

Clearly, any mixture of unitary channels remains uni- 
tal. An interesting question is to ask whether one can 
reverse this procedure, i.e., decomposing any unital quan- 
tum channel $ G T("H(j) into a mixture of unitary chan- 
nels from U(Hd)- This was called "quantum Birkhoff 
conjecture" (QBC), originated from Birkhoff 's celebrated 
characterization of the extreme points of doubly stochas- 
tic matrices. Unfortunately, this conjecture is only true 
for d < 2, and counterexamples exist whenever d > 3 
0-0] • This suggests the following quantity to measure 
the distance between $ and the convex hull of unitary 
channels. 

D($,Conv(UCH))) = inf{L>($,*) : * e Conv(U(W))}, 

where _D($,^) will be given by the diamond norm of 
<f> — '5. Since ConvCH^)) is a compact convex set, "inf" 
in the above equation can be replaced by "min" . 
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Motivated by some results in about the environment- 
assisted quantum capacity and in an attempt to remedy 
the conjecture in certain way, Smolin, Verstraete, and 
Winter proposed the following 

Conjecure 1. (Asymptotic Quantum Birkhoff Conjec- 
ture $]) Let $ e T{H) be a unital channel, then $® n 
can be approximated by a mixture of unitary channels 
from U('H l8m ) with arbitrary precision. That is 

Iim n ^oo£($® n ,Conv(U(ft®"))) = 0. 

This revised conjecture seems highly reasonable as one 
could naturally expect that many copies of a unital chan- 
nel will be better approximated by a mixture of unitary 
channels on a higher-dimensional space. If this is true, 
it will provide a very satisfactory interpretation to the 
following result: The environment-assisted quantum ca- 
pacity of any unital channel over Jj(7id) is given by log 2 d 
qubits, the maximum capacity one can achieve under this 
model. A much more deep consequence is that the struc- 
ture of unital channels will be greatly simplified. Due to 
its significance, the asymptotic quantum Birkhoff conjec- 
ture was listed as one of major open problems in quantum 
information theory Q. 

Some supporting evidences were obtained in Ref. [l(| , 
where Mendl and Wolf presented a unital channel $ such 
that $® 2 is a mixture of unitary channels although $ 
itself is not. Furthermore, they showed that it is pos- 
sible that the tensor of $ and a constant unital channel 
(a completely depolarizing channel that maps every state 
into the completely mixed state I /d) may become a mix- 
ture of unitary channels. One may naturally conjecture 
these properties might be true for any unital quantum 
channels. 

Recently Haagerup and Musat disproved this asymp- 
totic version by exhibiting a class of so-called non- 
factorizable maps as counterexamples [ll[. Actually the 
results obtained in Ref. [llj shows that any such non- 
factorizable map $ is a very strong counterexample to 
AQBC in the following sense: 

D(<S> ® * , FM(L(H d ® Um)) > FM{L{U d ))), 

where ^ is any unital channel over L('H m ), and 
FMilji'Hd)) denotes the set of factorizable maps over 
L('H £ i)- In other words, any non-factorizable map tensor- 
ing with a unital channel could not reduce the distance 
to the set of factorizable maps, which is a super-set of the 
convex hull of unitary channels. See also Shor's talk in 
Ref. [12[ for an alternative approach to AQBC and an ex- 
cellent discussion of the results in Ref. [ll| . The interest- 
ing thing here is that all these counterexamples are non- 
factorizable maps, and it remained unknown whether any 
facterizable map would fulfill AQBC. This problem was 
signified in the arXiv version of Ref. [ll[ by establish- 
ing the following surprising connection: If all factorizable 
maps satisfy AQBC, then the Connes embedding prob- 
lem has a positive answer. 



Motivated by these progresses and in order to better 
understand the structure of unital channels, in this pa- 
per we are interested in estimating the trace distance be- 
tween a unital quantum channel and the convex hull of 
unitary channels, say £>($, Conv(U(%))). We find that 
this distance is interesting even from the perspective of 
quantum channel discrimination: Suppose we are given 
an unknown quantum channel, which is secretly chosen 
between $ and some ^ € Conv(U('H))) with equal proba- 
bility 1/2. Then due to the operational meaning of trace 
distance, we can conclude that the success probability 
of discrimination is at least 1/2 + l/4£>($, Conv(U(H))), 
which is strictly larger than 1/2 whenever $ is not a mix- 
ture of unitary channels. Another purpose of this paper is 
to provide some relatively elementary and self-contained 
disproofs to AQBC. This is partially due to the fact that 
the elegant disproof of AQBC in Ref. makes use of 
some basic properties of factorizable maps which cannot 
be easily appreciated by readers who do not have deep 
background in operator algebras. 

In Section II we collect some preliminaries about super- 
operators and Schur channels. Then in Section III we ex- 
plain in detail the operational meaning of trace distance. 
In Section IV we first provide a computable lower bound 
for -D($, Conv(U('H))) when the Kraus operator space of 
$ does not contain any unitary operator. This enables 
us to derive many counterexamples for AQBC, includ- 
ing some factorizable maps presented in Ref. [llj]. It 
is worth pointing out that this proof only employs some 
basic techniques from quantum information theory. We 
believe that it may interest readers with quantum infor- 
mation background. In Section V we go further to study 
the class of Schur channels. In this special case, we are 
able to provide a lower bound and an upper bound for 
D(&, Conv(U(T-L))). Roughly speaking, we show that up 
to a factor of 1/2, any Schur channel can be approxi- 
mated by a mixture of diagonal unitary channels, and 
the later has a simpler structure. As a direct applica- 
tion, we obtain a new proof of the fact that any Schur 
channel that does not satisfy the QBC will automatically 
violate the AQBC. Our proof for this part has employed 
some powerful tools from operator algebras. In Section 
VI we present two explicit examples of Schur channels to 
demonstrate the utility of our results: the first example 
has only two Kraus operators and is a non-factorizable 
map, and the second one is a factorizable map. As an- 
other interesting application, in Section VII we point out 
a connection between AQBC and Grothendieck's inequal- 
ity in the metric theory of tensor products. 

Remarks on related results: After we obtained the 
results in Section IV, and were working on the proof of 
the Theorem[3]in Section V, the second author R.D. hap- 
pened to learn from Prof. M. B. Ruskai that Haagerup 
and Musat had made further progress on the connec- 
tion between Schur channels and AQBC. Namely, they 
obtained Theorem [5] and thus showed that any Schur 
channel that violates QBC (including some factorizable 
maps) should also be a counterexample to AQBC [l3j . 
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They also provided a modified version of the connection 
between factorizable maps satisfying AQBC and Connes 
embedding problem. The proof of Theorem [3] has em- 
ployed some similar techniques in (l3| . 



II. PRELIMINARIES 

We will use symbols H, K,, etc to represent finite di- 
mensional Hilbert spaces over complex numbers. A d- 
dimensional Hilbert space %, which is essentially the 
same as C d , will be explicitly represented as TLd when- 
ever the dimension matters. L('H,/C) denotes the set of 
linear operators (or mappings) from % to fC, and L(K) is 
shorthand for L(U,H). For any X G L(H), Z f G L{U) 
denotes the adjoint operator (or complex conjugate) of 
X. X is Hermitian (or self-adjoint) if = X. U G h(H) 
is said to be unitary if W U = I-h ■ We denote the set of 
unitary operators on H by U(H). X e L('H) is (semi- 
definite) positive, write X > 0, if the quadratic form 
(ip\X\ip) > for any \if>) E H. In particular, X is 
said to be a density operator (or a quantum state) if 
X is positive and with trace one. T{H, K.) is the set 
of linear mappings from L(7i) to L(/C). Again, T(%) 
is shorthand for T(W,W). Elements in T(H,1C) are nor- 
mally called super-operators. Note that L("H) is a Hilbert 
space with the standard Hilbert-Schmidt inner product 
< A,B >= Ti (A^B). Then the adjoint operator of 
$ G T(H,JC) is defined as the unique super-operator 
$t g T(K,n) such that 

< y.spo >=< $ f (r),x >, va g L(«),ye l(/c). 

A super-operator $ e T(H,K.) is said to be positive if 
it preserves the positivity, say, $(A) > whenever X > 
0. $ is said to be a quantum channel if it satisfies: i) 
(trace-preserving) Tr($(X)) = Tr(A) for any X £ L(H), 
and ii) (completely positive) for any n > 1, the induced 
super-operator $„ = $ (g) 4(W„) e T C^ ® W„,/C ® H n ) 
is positive, where /l(h,i) 1S the identity super-operator 
on L(H n ). We call $ a quantum unital channel if it 
further satisfies: iii) (unital condition) $(/-«) = I/c- Any 
unitary operator U G U("H) induces a unitary quantum 
channel U G T("H) in the following way: W(X) = f/Xt/t. 
The class of unitary channels on L(H) will be denoted as 
U{H). 

Any super-operator $ G T(H,/C) can be represented 
by a pair of linear operators A, B G h(H,lC €5 2) such 
that 



(1) 



where Z is an auxiliary Hilbert space with dim(Z) < 
dim('H)dim(/C), and Trg represents the partial trace over 
Z. For the special case of quantum channels, the above 
form can be greatly simplified. Actually, in Eq. (TT]) we 
can choose A = B = V G L(H,fC <g> Z) for some isom- 
etry V and obtain the following well-known Stinespring 
unitary embedding representation of a quantum channel: 



$(A) = Ti z VXV\ V ] V = L, 



If we specify an orthonormal basis {|fcz)} of we can 
rewrite $ in Eq (fT]) into the following form: 



dim(2) 

<s>po= J2 A * XB l 
fc=i 



(3) 



where Ak = (kz\A and B^ = (kz\B are linear operators 
in L("H,/C). Similarly, when $ is a quantum channel, we 
can choose At = B^ = (kz\V so that 



dim(Z) 



$(x)= ^ A fc ^4, 24^ = i w , (4) 



fc=i 



which is the famous Kraus operator sum representation 
of a quantum channel 

Now we tend to introduce norms of super-operators in 
T(H,JC) based on the norms of linear operators. We refer 
to Refs. 15] for some detailed discussion on norms 
of super-operators and how to compute them using semi- 
definite programming techniques. We will briefly review 
some basic results for later use. For any X G L(Hd) and 
p > 1, the p-th norm of X is given by 



\X\ 



(Tr|X| 



where \X\ — v)OX. The trace and the operator norms 
of X are special cases of p = 1 and p — > oo, respectively, 



Mil =1*1*1 



I-XIIqo = max II X 



The trace norm and the operator norm of a super- 
operator $ G T(%,/C) are given respectively as follows: 



i$n 1= sup \mx)\\ 1: h*ii c 
iwii<i 



sup IMX)^. 

\\x\U<i 



In the above equation we can replace "sup" with "max" 
when only finite dimensional Hilbert spaces are involved. 
The completely bounded trace norm (or diamond norm) 
and operator norm (simply completely bounded norm) 
are given respectively as follows: 



1*1 



SUp||$®/ L(Kn) ||i, ||$|| cb = SUp||$®/ L 



(H„)l|oo- 



Proposition 1. For any <I> G T(W,K,), the diamond 
norm and the completely bounded norm satisfy the fol- 
lowing properties: 

• i) The dimension of the auxiliary system to achieve 
the norms can be restricted to that o/'H,||$|L = 



ll$®-?L(«)||i and ||$|| cb 



1$, 



'L(K)l|oo- 



H ■ 



(2) 



• ii) The following duality relation holds for <& and 

\m\i = ll $t IU and = ll^llcb. 

• iii) If $ is completely positive, then ||$||<> = ||3>||i 
and ||$|| cb = H^Hoo = W^il^W^. 



4 



The norms denned above enable us to introduce dis- 
tance between quantum states and quantum channels. 
The trace distance between two quantum density opera- 
tors p and a in L(%) is given by 

D(p,a) = \\p-a\\ 1 . 

In the following discussion we also need the fidelity be- 
tween p and a, 

F(p, a) = Try/ p^ap 1 / 2 . 

The so-called Uhlmann theorem makes the meaning of 
fidelity more transparent: 

F(p,*)= max KV#>|, 

where \tp),\<j)) &%®K range over all purifications of p 
and a, respectively, say Trjc|^>)(^| = p and Ttk;|</>)(</>| = 
a. Most notably, the above equation remains true even 
when one of \ip) or \<f>) is fixed. This fact plays a crucial 
role in our later discussion. Trace distance and fidelity 
are equivalent in characterizing the distance between two 
states in the following sense: 

2(1 - F(p, a)) < D(p, a) < 2y/l ~ F 2 (p,a). 

Following the same idea, we can define the trace dis- 
tance between two quantum channels $ and \& via the 
following way: 

£>($,¥) - ||*-*||o. 

Let us now introduce a special class of super-operators. 
For any S E L{Hd), we can define a super-operator $s 
via the following way: 

<S>s(X) = SoX, VIeL(%), 

where S o X — [s k jX k j] is the entry- wise product or 
Hadamard product. (Here we assume that we have spec- 
ified an orthonormal basis {\k} : k — 1, • • • , d} for Hd- 
Thus any linear operator from L('Hd) is expressed as a 
matrix under the standard matrix basis For 
instance, S = Ylk,j s kj\k){j\)- Such $5 is called Schur 
multiplier induced by S. Schur multipliers have been 
extensively studied in the literatures of operator alge- 
bras. We refer to Chapters 3 and 8 of Re f. M for some 
highly accessible introductions, and Ref. [11J for recent 
advances. For later use, some basic properties of Schur 
multipliers are listed as follows: 

Proposition 2. Let S E L(Hd)- Then <&s satisfies the 
following: 

• i) $s = X^fc=i ^fc ' ^t> w here all A k , B k are diago- 
nal matrices; 

• ii) $5 is positivity-preserving iff S is positive; 

• Hi) $5 is completely positive iff S is positive; 



• iv) $5 is trace- preserving if s kk — 1 for k = 

• v) $5 is unital iff Skk — 1 for k = 1, • • • , d. 

Proof: iv) and v) follow directly by evaluating 
Tr($s(|fc)(j|)) = SkjSkj- We shall see that ii) and Hi) 
are simple corollaries of i). So we first prove i). In fact, 
let A and B be any two dxd matrices such that S = AB* . 
We may assume A = [a%, ■ ■ ■ , aj] and B = [&i, • • • , bd], 
where a& and bk are all <i-dimensional column vectors. 
Then S = Y, k a k b\. Set A k = Diag(a fe ), B k = Diag(& fe ). 
That is, A k and B k are diagonal matrices with diago- 
nals a k and b k , respectively. By some routine calcula- 
tions we directly verify that 3>s(X) = J2 k =i A k XB\ for 
any X G L('Hrf). In particular, when S is positive we can 
write S = AA^ for some A E L(Wd)- Hence we can choose 
A k = B k in this special case. That proves both the posi- 
tivity and completely positivity of <I>s- Conversely, if $s 
is positive. Then by choosing |e) = Y] k _-, \k), we have 
$s(|e)(e|) = S is positive. □ 

The following proposition gives another fundamental 
property of Schur multiplier. Relevant discussions can 
be found in Page 110 of Ref. [|. 

Proposition 3. For any Schur multiplier $, the dia- 
mond norm, the trace norm, completely bounded norm 
and operator norm all coincide, that is, ||$||« = \ \&\\i = 

||$||cb = PII°o. 

So a Schur multiplier $5 is a quantum channel iff S is 
positive and with all diagonal entries one. In particular, 
whenever <I>5 is a quantum channel, it is also unital. We 
shall denote 

S(H d ) = {^s-Se L(H d ), S > 0, s kk = 1, 1 < k < d}, 

and call the elements from S(Hd) (or simply Sd) Schur 
channels. Note that the difference of two Schur multi- 
pliers is still a Schur multiplier. Applying Proposition [31 
we obtain an immediate consequence that auxiliary sys- 
tems are not required to distinguish between two Schur 
channels. 

III. OPERATIONAL INTERPRETATION OF 
TRACE DISTANCE 

We have introduced trace distance between quantum 
states and quantum channels, and will study the trace 
distance between a unital quantum channel and the con- 
vex hull of unitary channels in greater detail. Before 
we proceed, we need justify the importance of this mea- 
sure from the perspective of quantum information. In 
one word, the trace distance characterizes some sort of 
stochastic distinguishability of quantum states and quan- 
tum channels. Actually, the trace distance naturally oc- 
curs when we study the following state discrimination 
problem. Suppose we are given an unknown quantum 
system whose state is secretly prepared in one of po and 
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pi, with equal priori probability 1/2. The task here is 
to determine the identity of the system with a success 
probability as high as possible. To do so we need ap- 
ply a two-outcome quantum measurement {Eq,E\} to 
the system, and to maximize the success probability of 
discrimination, i.e., 

P S ucc{po,Pi) = max i(Trp -Eo + Trpi^i), 

{Eq,Ei} Z 

where Ei > and Eq + Ei = I. By some simple algebraic 
manipulations, one can verify that the optimal success 
probability of discrimination is given by [16( 

Psucc{po,Pi) = ^ + -D(po,pi). 

Thus a larger trace distance between po and pi implies 
a higher success probability of discrimination. This in- 
terpretation can be extended to compact convex sets of 
density operators. Let Ao and Ai be two compact convex 
sets of density operators. The trace distance between A 
and Ai is given by 

D(Ao,Ai) = min{D(p ,Pi) : pi e A+,i = 0,1}. 

Then the optimal discrimination probability between Ao 
and Ai is given as 

P succ (A 0) A 1 ) = i + ^(A ,A 1 ). (5) 

The above formula indicates that we can operationally 
distinguish between two compact convex sets of density 
operators by performing a universal quantum measure- 
ment, and the success probability of discrimination is 
completely characterized by the trace distance between 
Ao and Ai. The most surprising thing here is that the 
quantum measurement we perform does not depend on 
the exact form of the unknown state except the assump- 
tion that it is from one of Ao and Ai. 

It seems that Eq. J5J was first obtained by Gutoski 
and Watrous in Ref. [18| by using the convex set separa- 
tion theorem. Jain provided a different way based on the 
minimax theorem [19j. For completeness, we will out- 
line the later approach as follows. Let {Eq,Ei} be the 
quantum measurement we need perform, and po and p\ 
be two states from Ao and Ai, respectively. Then the 
optimal success probability is given by 

P S ucc(A , Ai) = max min ^-(Tr p a E a + Trpi-Ei). 

{-Ei} Pi6Ai Z 

The crucial point here is that we first take "min" over 
all possible pair of states po and p\ according to a fixed 
measurement {Eq, Ei}, and then take "max" over all pos- 
sible measurements to maximize the success probability 
of discrimination. Noticing that the objective function is 
linear in (E , Ei) and (po, pi) when one of them is fixed, 
and all involving sets are compact convex, we can apply 



appropriate form of Sion's minimax theorem |17| to ex- 
change the order of "max" and "min", and obtain Eq. 
([5]) immediately. 

Now we try to generalize the above result to the case of 
quantum channels. The simplest case is to distinguish be- 
tween two quantum channels <I>o, $i G T(H,IC). The ba- 
sic strategy here is to choose an input state p € L(H'®H), 
and then to distinguish between the respective output 
states /l(-H') ® ^i(p): where H' is a finite-dimensional 
auxiliary state space. We have 

D($o, = D((I HW) ® $ )(p), {Ihw) ® *i)(p))- 

To achieve the maximum success probability, we need 
take "sup" over all possible input states, and have 

D($0i$i) = supD($ ,$i;p)- 

p 

One can readily verify that the RHS of the above equa- 
tion gives us the diamond norm ||$ — 3>i||o, an d p can 
be restricted to density operators onH^V. (thus "sup" 
can be replaced as "max"). To generalize the trace dis- 
tance to compact convex sets of quantum channels, we 
first need the trace distance with input state p as follows: 

5(C ,Ci;p) - min £>($„, 

Then the final resulting operational trace distance be- 
tween Co and Ci is given by 

5(C ,Ci) = sup5(C ,Ci;p) = sup min L>($ , ®V, p), 

p p *i£C, 

(6) 

where p ranges over all possible bipartite density opera- 
tors on W <&H, and it is not clear whether we can re- 
place "sup" with "max" as the dimension of H' may be 
arbitrarily large. The optimal success probability of dis- 
crimination between Co and Ci is given by 

ftuccCCo.CO = i + l5(G ,Ci). 

Interestingly, the (ordinary) trace distance between Co 
and Ci is given by 

D(Co,Ci)= min D($ ,$i) = min max5($o, p)- 

The major difference between D(Co,Ci) and D(Cq,Ci) 
is that the orders of "max" ("sup") and "min" has been 
reversed. It is not obvious that whether the orders of 
"min" and "max" ("sup") are exchangeable or not as 
it is unclear whether the objective function D($o, $i; p) 
satisfies the requirements of minimax theorem. Conse- 
quently, it seems not clear whether D(Co, Ci) is the same 
as D(Co,Ci). Nevertheless, we still have 

5(C ,Ci;p) < 5(C ,Ci) < D(C ,Ci). 
In particular, we have the following simple property. 
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Property 1. Let Co,Ci C T(H,fC) be two compact con- 
vex sets of quantum channels, and let p be a bipartite 
pure entangled state over "H^Tl with full Schmidt rank. 
Then the following are equivalent: 

i) . C n Ci = 0; 

ii) . D(C ,Ci) >0; 
Hi). D(C , Ci) > 0; and 
iv). 5(C ,Ci;p) > 0. 

Proof: Wc only need to establish the equivalence 
between i) and iv). By definition, iv) means that we 
can distinguish between Co and Ci using p as an in- 
put. This immediately implies that Co and C\ should 
be disjoint. In other words, i) should hold. The direc- 
tion that i)=> iv) is a little bit tricky, and the key here 
is to apply a generalized form of Choi isomorphism Q 
between super-operators and bipartite linear operators. 
By contradiction, assume that Co and Ci are disjoint 
but -D(Co, Ci; p) = 0. It follows from the definition that 
there exist $o S Co and $i 6 Ci such that 

D((I Hn) ® * )(p), (I HH) ® $i)(p)) = 0. 

Equivalently, we have 

(I Hn) ® * )(p) = (4(H) ® (7) 

Noticing that p is a bipartite pure state with full 
Schmidt rank, we have the following generalized Choi- 
isomorphism: 

J : $^ (7 L(W) ® *)(p). 

(The standard Choi-isomorphism is to choose p as the 

maximally entangled state |0) = 1/ VdJ2t=i 1^)1^))- 
Applying this isomorphism, we deduce from Eq. Q that 
$o = *i- This contradicts the assumption Co H Ci = 0. 
□ 

So whenever two compact convex sets of quantum 
channels are disjoint, wc can operationally distinguish 
between them with a success probability strictly larger 
than ^, and any bipartite pure state with full Schmidt 
rank can be used as input. 

The really interesting thing here is that the equality 
of 5(G ,Ci) = D(C ,Ci) does hold. The key to this 
is the application of Sion's minimax theorem and the 
following semi-definite programming characterization of 
the diamond norm recently discovered by Watrous [2(| ■ 

Lemma 1. (Watrous \la]) For any super- operator $ = 
$0 — *i such that *o an d *i are quantum channels in 
T(H,1C), we have the following 

||*||o =max2Trp 4 X, X < I® p,Trp= l,p>0,X > 0, 

where p$ = ($ ® I-H')(\ a )( a \) is the Choi operator of 
$, | a) = Ylk=l \fy ® 1^) = Vd\il) is the unnormalized 
maximally entangled state over H. ® W , and %' is an 
isomorphic copy ofH. 



Now we can summarize the relation between D(Gq, Ci) 
and -D(Co, Ci) as follows: 

Theorem 1. Let Co and Ci be two compact convex sets 
of quantum channels in T("H,/C). Then 

5(Go,Ci) = D(C 0) Gi). 

Proof: Let us first denote 

C = Co - Ci = {$ - *i : $o € Co, $i £ Ci}. 

Then C is a compact convex set, and completely deter- 
mines D(Cq, Ci) and D(Co, Ci). We also write 

R = {(X, p):0<X<I®p,p>0,Trp=l}. 

Clearly, R is also a compact convex set. 
By Lemma [TJ we can rewrite 

-DfCo, Ci) = min max 2Trp$X. 

Noticing that both C and R are compact convex sets, 
and the objective function 2Tr(p$A) is linear both in $ 
and (X, p), by Sion's minimax theorem we can exchange 
the order of "max" and "min" as follows: 

-D(Co,Ci)= max min2Trp$X. 

(x, P )efl*ec 

Now we proceed to prove D(Cq, Ci) = D(Gq, Ci). We 
only need to show I)(Co, Ci) > D(Cq, Ci) as the opposite 
direction is obvious according the definitions. By the 
above equation and Eq. ©, it suffices to show that for 
any (X, p) £ R there is a density operator a G L(H®H') 
such that 

||(* ® /)(o-)l|i > Trp$X,V* G C. 

Indeed, we can choose a — \u)(u\ to be the following 
bipartite pure state 

\u) = (I <g) A) | a) and A^A = p, 

where \a) is again the unnormalized maximally entangled 
state over H®%'. 

Note that we have the following well-known fact about 
the trace norm: 

||y||i = max 2TrPy, 

0<P<I 

where Y is any traceless (Try = 0) Hermitian operator. 
Applying the above fact to (* €5 1)(<r), we have 

1 1 (*®Z)(cr) ||i = max2TrP(/(g)yl)p$(/(g)A t ) = max2Trp 4 ,<5, 
P Q 

where < P < I n ®u> and Q = (I ® A^)P[I ® A). 
Noticing that < X < I®p = I® A* A, we can easily find 
< P' < Iu®w such that X = Q 1 = (I <g> A^)P'(I ® A) 
[2l| . Thus we have 

max2Trp$Q > 2Trp$Q' = 2Trp$X, 
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which completes the proof. □ 
Remarks: After we finished the above proof, we were 
informed by Gutoski that in a recent work he generalized 
the results in Ref. [l8[ to the discrimination of two com- 
pact convex sets of quantum strategies, and obtained the 
results for the case of quantum channels as an immedi- 
ate corollary (23|. It is interesting to note that his main 
proof technique is a separation theorem of compact con- 
vex sets from convex analysis, quite similar to that in 
Ref. 18]. Instead, here we employ a different method 
by using Sion's minimax theorem and semi-definite pro- 
gramming characterization of diamond norm, in a similar 
spirit of Ref. . Hopefully, our proof may provide some 
new insight into this problem. Gutoski's paper, however, 
contains many other interesting results about the trace 
norms. 

It is also worth noting that with minor changes the 
same technique in the above proof can be used to derive 
Lemma [TJ as first shown by Watrous in Ref. [15j . 

All the above discussions are applicable to the case 
of C = {$} and Ci = Conv(U(H)). An interesting 
fact is that without auxiliary systems, we cannot opera- 
tionally distinguish between a unital quantum channel $ 
and Conv(U(7i)) even when the former is not contained 
in the latter. To see this, let p G L(H) be any density 
operator. Since $ is a unital quantum channel, it is also 
a doubly stochastic map. Thus we have the majorization 
relation $(/?) -< p (22[. By another Theorem of Uhlmann 
241 ], we know there exist a probability distribution {p k } 
and a set of unitary operators {Uk} such that 

= ^p k U h pUl 

k 

So D($,Gonv(U(H));p) = for any density oper- 
ator p from L('H). On the other hand, we have 
D($>, Conv(U(%))) > even when the input can only 
be chosen from h(7i). This indicates that D and D are 
quite different when we do not use auxiliary systems. 



IV. A LOWER BOUND FOR THE DISTANCE 
BETWEEN A QUANTUM CHANNEL AND THE 
CONVEX HULL OF UNITARY CHANNELS 



Proof: Let * = Y^k=\VkUk with {pk} a finite prob- 
ability distribution and Uk G U(Wd)- We need to show 
that 

£>(*, #) = £>($, 5> fc Z4)>C*. 

k 

Note that 

£>($, ^pkUk) =D(§ <g> I HZd) ,^p k U k <g> I HZd) ) 

k k 

>d($ ® i UZd) {n),Y,PkU k ® /L(* d )(n)), 

where |fi) = 1/Vd^2k=i 1^)1^) ^ s a maximally entangled 
state on *}id®Zd- Now applying the inequality D(p, a) > 
2(1 — F(p 1 a)), we have 

£>(#, ¥) >2(1 - F(0> ® I HZd) (Q), Y^VkUk ® 4(*„)(fi))) 

ft 

-2(1 -max \{1>\4>)\), 

where \ip) = J2k 1^)1^0 ran g es over an purifications 
of $(g> / L (2: d )(0), \4>) is a fixed purification of ^ k PkU k ® 
Ih(z d )(ty given by 

\4>) = Y,y/P*{u k ®i Zd )\n)®\kK), 
k 

{\kic)} is a fixed orthonormal basis for an auxiliary sys- 
tem JC, {qk} is a probability distribution, and \tp k ) are 
unit vectors in "Hd ® 2d- An important observation here 
is that \ipk) is in the support of $ ® ^L(2 d )(^) which is 
spanned by a set of vectors of the form (Ej <E> Iz d )\£i), 
where we assume that $ = -E\. Hence 

for some complex numbers Xj, from which we readily 
deduce that 



where 



IV; fe ) = (L k ®i Zd )\n), 

Lk = ^2 XjEj G K($) 



It is generally difficult to decide whether a given unital 
quantum channel $ is a mixture of unitary channels or 
not. One simple sufficient condition is that the Kraus 
operator space K(^>) does not contain any unitary op- 
erator, i.e., K($) n \J(H) = 0. (Note that the Kraus 
operator space K($) = span{Ek} for a quantum channel 
$ = J^k Ek ■ E\). If this is the case, we can actually ob- 
tain an analytical lower bound for the distance between 
$ and Conv(UCH)). 

Lemma 2. For any quantum channel $ G T("Hci) such 
that K($) n U{H d ) = 0, we have 

Tr(|L|-/ d ) 2 



£>($, Conv(UCH d ))) > min 



> 0. 



Since (V'fc) are unit vectors, Tr|Lfe| 2 = Tr(L\.Lk) = d. 
Thus we have 

£>(*, *) >2(1 - max | V v ^Tr(4?7 fe )/d|}) 

L k ,q k i — J 
k 

>2(1 - max{Tr(L[i7 fe )/d : L fc G #($)}) 

>2(1 - max{-Tr(£tt7) : Tr|L| 2 = d, 17 G U(^ rf )}) 
a 

=2(1 - i max{Tr|L| : Tr|L| 2 =d,Le K(®)}) 
a 

= min{iTr(|L| - l Hd f : Tr|L| 2 = d, L G 
>inf{iTr(|L|-/« d ) 2 : L G X($)}. 
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In the last step we have to use "inf" instead of "min" 
as the domain of L has been broadened from a compact 
set {L G K($) : Tr\L\ 2 = d} to an unbounded set K(3>). 
To finish the proof, we need to show that "inf in the 
last line can be replaced by "min" . First, notice that the 
RHS of the above equation is less than 2, and 

Tr{\L\-I Hd ? > (Tr|L|-d) 2 



d 2 



If Tr|L| > (x/2 + l)d then the right hand side (RHS) of 
the above equation is greater than 2. Thus 

inf min ^ Z ^ _ 



Leif(*) 



Tr|L|<(l+V2)d 



As a final remark, we need show that C$ > under 
the assumption K ($) n U(W d ) = 0. Otherwise, C$ = 
implies that there is some L G A(<I>) such that |i| = 7^^- 
In other words, L is unitary, which is a contradiction. □ 

Theorem 2. Lef $ G T(Hci) &e a quantum channel, and 
let \& G S(H m ) be any Schur channel. Then 

® Conv(U(W ro ® Hrf))) > C7«. 

Proof: The key observation here is that under the 
assumption ^ is with diagonal Kraus operators. Thus 
any L G K{$1 ® <&) can be decomposed as 

L = ®f =1 L k , L k eK{§). 

Suppose now that L = ®™ =1 Lk achieves the minimum in 
C^0$. We have 



C*®* =— jTr(|i| — /H m gjiJ 2 



1 

md 

-_ J_ Tr(eJT=i(|£ fc |-^J) 2 

ma 

=^TV(©r =1 (|Z fc |-7 w J 2 ) 



>C<t 



where we have employed the fact that 

Tr(\L k \~I Hd ) 2 >dC^,y l<k<m. 

Now the desired result follows from Lemma [2] directly. □ 
As a direct corollary, we have the following 

Corollary 1. For any Schur channel $ G S(Hd), if 
K{$) n \J(H d ) = 0, then 

Conv(U(Hf n ))) > (7$ > 0, Vn > 1. 



V. BOUNDS ON THE DISTANCE BETWEEN A 
SCHUR CHANNEL AND THE CONVEX HULL 
OF UNITARY CHANNELS 

The condition that the Kraus operator space K ($) of 
$ does not contain any unitary operator is a very strong 
constraint. In most cases we may have that $ is not 
a mixture of unitary channels but AT(<I>) contains some 
unitary operator. Here we deal with this more general 
case but only for Schur channels. In this case we are able 
to show that up to a factor of 1/2, any Schur channel 
can be approximated by a mixture of diagonal unitary 
channels. 

Let us denote 

A(H d ) = S(H d ) n Conv(U(H d )). 

Intuitively, Affld) (or simply Ad) is the set of Schur chan- 
nels that are also mixtures of diagonal unitary chan- 
nels. So any f 6 can be written into the form 
^ = J2kPkU k ■ U\, where U k are d x d diagonal unitary 
matrices. 

Theorem 3. For given Schur channel $ G Sd, we have 
\d($>, Ad) < £>(*, Conv(tU d )) < £>($, A d ). (8) 

Proof: The second inequality follows directly from 
A d C Conv(Vd)- We will employ some standard argu- 
ments in operator algebras to prove the first inequality. 
Let * = Y^kVkUk G Conv(U d ) such that 

£>($, *) = £>($, Conv(Ud)) = 5. 

We only need to prove that 

£>($,A d ) <2S. 

For any two diagonal unitary matrices U and V, let us 
introduce a map J uy : T(H d ) -> T(H d ) as follows: 

J u ' v (<f>) = U^(U-V)Vl 

It is obvious that J u,v is an isometry over T{T-Ld) in the 
following sense: 

DGJ^i), 3 U ' V (^)) = £>(*i, $2) (9) 

for any $ 1; $ 2 G T(H d ). 

Now we can further introduce a map J : T(%d) — > 
T(^ d ) such that 

J($) = f f J u < v ($)dUdV, 
Jv d Jv d 

where both dll and dV are Haar measures over the diag- 
onal unitary group U^. The map J satisfies the following 
properties: 

i). J is a contraction in the sense 

D(J($i),J($ 2 )) < D(*i,$ 2 ),V$i,$2 G T(ft d ), 
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which is a simple consequence of the convexity of the 
diamond norm and Eq. @. 

ii). J($) = $ for any Schur multiplier $ e T(Hd)- 
This is true simply due to the following observation 

jv> v ($) = $, 

where U, V G Ud are diagonal unitary matrices. 

hi). J($) is a Schur multiplier for any $ G T(%d)- In 
particular, J($) is CP whenever $ is CP. To see that, by 
a direct calculation, we find that for $ = £ fe £ fc • F fc f , 

J($) = £ 4 • F'J, E' k = di ag (E k ), F' k = diag(F fc ). 

Clearly, J($) is a Schur multiplier. When $ is CP, we 
can choose 

E' k =F' k = diag(i? fc ) = diag(F fc ), 

thus J($) is CP. 

Now we can compute that 

= aw = = 5> fc A fc • 4, 

k k 

where A k = diag(J7fe). We have 

D($,A d ) <D($,V') + D{*', A d ). (10) 

The first term in the RHS of Eq. ([TO]) satisfies 

£>($, *') = D(J($), J(*)) < £>($, *) = <5, 

where we have employed the contraction property of J, 
item i) above. 

It remains to show that the second term in the RHS of 
Eq. ((10} fulfills 

D(&,A d )<6. 
Our strategy is to choose ^" 6 T(H d ) such that 

£>(*',*") < S, tf" G A d (11) 

Then 

A d ) < £>(*',*") < 5. 

The rest of the proof devotes to finding such "J". Notice 
that each Kraus operator A k of is a diagonal contrac- 
tion. Applying a well-known fact in linear algebra, we 
can choose two diagonal unitary matrices Wu such 
that 

1 



A k = ~(V k + W k ). 



Now define 



*" = £f(vv^ + wvwt) 



Clearly g A rf . We will show that \&" satisfies Eq. 
([TT|). First, we find that - W is a CP map. By a 
direct calculation, we have 



Thus 



is also a CP map. 

Now employing essentially the same techniques first 
introduced by Haagerup and Musat in (l3| . we have 



I*" - *'llo = 



< 



< 



||*" T -*' T ||cb 

||(*"t-*'t)(/ w J|| 

H^-^IwJIloo 

||$ - *'||i 

||$-*'||o. 



That means 



£>(*',*") < £>($,*') < 5. 



□ 

It seems quite likely that in Eq. ([8]) the first inequality 
should be strict and the second one should be an equality. 
However, this is still an unsettled issue. 

Theorem 4. For given Schur channel $ G Sd and arbi- 
trary W G S m , we have 



® $, A mrM ) > £>($, A d ), V* G S J; 
i/ere m® d is a shorthand for 1-Lm ® %d ■ 



(12) 



Proof: To show Eq. (1121) . we first choose $' = 
J2 k PkUk ■ Ul £ A m(Sd such that 

D(¥<8$ ) A ro(8 «0 = D(*<8 $,¥')■ ( 13 ) 

Notice that any diagonal unitary matrix £/& can be writ- 
ten into the following form: 



i=i 



where I/. G Ud are all diagonal unitary matrices. That 
implies 



(1|*|1) eA* 
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where is the unitary channel corresponding to Lf[ . 
Intuitively, the compressed version ^>" of $' remains a 
mixture of diagonal unitary channels. Now we have 

D(#®$,#')> SU P ||(#®$-*')(|l}<l|®^)lli 
ll*lli<i 

= sup |||1><1|® ($-*")(X)||i 
ll*lli<i 

= ||$ - *"||o 

A d ), 

where we have used the fact that \E'(|1)(1|) = |1)(1| and 

*" = <i|¥'|i)eA d . 

Combining the above equation with Eq. (|13[) , we have 
proven Eq. ([T2"]). □ 

A somewhat interesting fact is that the above two re- 
sults together can be used to derive some results first 
obtained by Haagerup and Musat in Ref. [13( , which are 
applicable to the AQBC. 

Theorem 5. (Haagerup and Musat 113 1) For given 
Schur channel $ £ S^ and arbitrary $ € S m , we have 

D{V ® $, Conv(IlW)) > iD($, Conv(U d )). (14) 



Proof: Actually Eq. ([14]) is a quite straightforward 
application of the above two theorems. First notice that 
$ ® $ € S m ®cj. Applying Theorem|3]to $ ® $, we have 

£>($ ® $, Conv(U m8d )) > -£>(¥ <g> $, A m8d ). 

On the other hand, it is obvious that 

#($,A d ) > C($,Conv(U d )). 

Thus the left thing is to show 

D(* ® $, A m8d ) > D(*,A d ), 

and this is exactly the content of Theorem |4j □ 

Corollary 2. (Haagerup and Musat Jldtl ) Let $ e 
&e a Schur channel that does not satisfy the quantum 
Birkhoff property, that is, $ Conv(Ud). Then $ does 
not satisfy the asymptotic quantum Birkhoff property, 
and 

D($ 8 ^Conv(U d .»)) > -£>($, Conv(U d )). 



VI. SOME EXPLICIT COUNTEREXAMPLES 
TO THE ASYMPTOTIC QUANTUM BIRKHOFF 
CONJECTURE 

Our results in Section IV enable us to construct coun- 
terexamples to AQBC easily. Our basic strategy is to 
construct Schur channel $ satisfying K($>) n U(H) = 0- 
Then the statement that $ is a counterexample to AQBC 
follows directly from Corollary [TJ 



Example 1. Our first example is chosen from Ref. Q 
(Section 4.3). $ = E x ■ E\ + E 2 ■ El, where 



£i=Diag(l > > -^ J -^) J E 2 



Diag(0,l,-L,--j=) 



Clearly none of E\ and E 2 is unitary. We now show that 
there is no unitary in K(<&). By contradiction, assume 
that for some complex numbers A and \i we have that 
\E\ + fiE 2 is unitary. Then 

(XE t + /i^ 2 ) t (AS 1 + nEh) = h, 

from which we obtain 

|A| 2 = 1, H 2 = 1, ^(|A| 2 + M 2 ) = 1, \{\X\ 2 - ImI 2 ) = 1. 

Clearly, there is no A and /i satisfying all the above 
equations. Thus we have if($) n U(% 4 ) = 0- 
It follows from Corollary [T] that $ is a counterex- 
ample to AQBC. One can readily verify that the 
set {E\Ei,E[E 2 ,E\Ei,E\E 2 } is linearly independent. 
Thus it follows from Corollary 2.3 of Ref. [ll| that $ is 
a non-factorizable map. □ 

Example 2. Our second example is taken from Ref. [Til ] 
(Example 3.3). $ = Y?k=i E k ■ e\, where 



E x = Diag(l, -L J 8 ), E 2 = Diag(0, \1 ~Z 5 ), E 3 = E. 



V5 



J 2- 



where Z b = Diag(l, , , ) satisfying Z\ = I 5 . 

In the following we directly write / and Z for J5 and Z§, 
respectively. 

As shown in [Tlj . one can choose a set of Hcrmitian 
Kraus operators Fi, F2, F3 such that 



1 



Ei — Ei, F 2 — —\E 2 + E3), F- L 



{E 2 - E 3 ). 



It is eas y to see that $ = X)fe=i ^l- By Corollary 2.5 
of Ref. [ll| , $ is a factorizable map. 

Now we show that K($) n U("H 6 ) = 0. Again by con- 
tradiction, assume there are complex numbers Ai, A2, A3 
such that X1E1 + X 2 E 2 + X3E3 is a unitary. In other 

words, Diag(Ai, \J\XiI + ^j\X 2 Z + ^J^X^Z^ 1 ) is a uni- 
tary. This is equivalent to 

I Ai | 2 = 1, \X 1 I + V2X 2 Z + V2X 3 Z- 1 \=V5I. 

For simplicity, we may assume Ai = 1, a = V2X 2 , and 
b = V2A3. Then we can rewrite the above equation as 
follows: 

(\a\ 2 +\b\ 2 -4)I+(a+b*)Z+(a* +b)Z- 1 +ab* Z 2 +ab* Z~ 2 = 0. 

Employing the fact that {I, Z, Z~ x , Z 2 . Z~ 2 } are linearly 
independent, we have 

|a| 2 + |6| 2 =4,a + fe* =0,ab* =0. 
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Clearly, there are no a and b satisfying all the above 
equations. 

Hence Corollary Q] is applicable. This gives us a fac- 
torizable map which is also a counterexample to AQBC. 
This fact has been pointed out in the published version 
of Ref . [ll| , and was derived by the result in Ref . [l3[ . 
□ 

As a matter of fact, all counterexamples to AQBC pre- 
sented above are simply the counterexamples to QBC. It 
would be quite interesting to know for what kind of unital 
channels $ these two properties arc different, i.e., $ is a 
counterexample to QBC, but fulfills AQBC. A systematic 
way to construct unital channels that violate QBC has 
been proposed by Bravyi and Smolin using the idea of 
unextendible maximally entangled bases [25| . All unital 
channels $ constructed in this way will automatically sat- 
isfy the condition K($>)Ci\J(H) — 0. However, it remains 
a formidable task to verify whether these unital channels 
fulfill or violate the AQBC. A preliminary step towards 
this goal is to invent some tractable upper bounds for the 
distance between a unital channel and the convex hull of 
unitary channels. 



VII. A CONNECTION TO GROTHENDIECK'S 
INEQUALITY 

It is well known that Grothcndicck's inequality (GI) 
in the metric theory of tensor products is closely related 
to Bell's inequality in quantum information theory (26[. 
As an interesting application of the results in Sections IV 
and V, we explain here that GI has intimate links with 
AQBC too. Let us first recall the equivalent formulation 
of GI in terms of Schur multipliers. Let 

§ d = {$ s :5eL(H d ),||$s||i <!}■ 

Namely, Sd is the unit ball of the space of all Schur 
multipliers on \j{%d) with respect to the trace norm 
or any of the three other norms considered before (see 
Proposition It is well known that $s € §y if 

and only if there exists a Hilbert space JC and vectors 
\£d), \vi)) •••) IVd) m the unit ball of JC such that 

Skj = (61%). (15) 

Here we have assumed that S = [sjy] with respect to 
a fixed orthonormal basis of T~L d . Note that JC can be 
chosen to be finite dimensional. This representation of 
$5 is to be compared with i) of Proposition [5] Indeed, 
developing and rjj in an orthonormal basis of JC, we 
recover the representation of $s given by Proposition [5] 
The case where dim/C — 1 is of particular interest. 
The corresponding set of Schur multipliers is denoted by 
Dd, that is, Dd is the set of all Schur multipliers <§>s of 
the form 

s kj = a%l3 h l<k,j<d, (16) 



where a k , Pj are complex numbers such that 

max|a/c| < 1, max|/3j| < 1. 

k j 

It is clear that 

Conv(© d ) C S d . 

GI asserts that the converse inclusion also holds true up 
to a universal constant: 

Grothendieck's inequality. There exists a universal 
constant K such that for all d > 1 

§d C KConv{B d ). (17) 

The smallest constant K is called Grothendieck's con- 
stant, denoted by Kq. The exact value of Kq is still 
unknown. But it is well known that 1 < Kq < 1.4049. 
(Note here all scalars are assumed to be complex num- 
bers). What is important for us is the fact that Kq > 1. 
We refer to Chapter 5 of Ref. [27j and page 19 of Ref. 
[28j for more information. 

It is easy to see that a Schur multiplier $5 is positive 
if and only if we can choose and r\ k in Q15p such that 
£,k = Vk for all 1 < k < d. Let S~^ denote the positive 
part of Sd- Accordingly, let denote the positive part 
of H>d- Namely, ID J is the set of all Schur multipliers of 
the form ([i"6]) with a k — fik- We again have obviously 

Conv(B+) C S+. 

Surprisingly, this time the converse inclusion does not 
hold up to a universal constant. More precisely, let Kf 
denote the least constant such that 

S+ C X+Conv(D+). 

Then we have the following result of Kashin and Szarek 
from Ref. [2j| (see also the lemma on the page 17 of Ref. 

Hi). 

Proposition 4. There exist two positive constants a and 
(3 such that a log d < < (3 log d for all d > 1 . 

The Schur multipliers we are interested in arc Schur 
channels (unital positive Schur multipliers). Recall that 
the set of all Schur channels on L(Wd) have been denoted 
by Sd in the previous sections. This set could be also 
denoted by S^j in the current notational system (1 being 

for "unital"). Accordingly, A d is the subset of Conv(O^) 
consisting of Schur channels. As shown in sections IV 
and V, to disprove AQBC is equivalent to showing that 
the obvious inclusion 

A d C S d (18) 

is strict for some d. We now show that this inclusion is 
strict for large d in the spirit of Proposition H) To this 
end let denote the least constant K such that 

S d C ifConv(D+) . 

Then inclusion (fTS)) is strict if and only if ifi\ > 1. 
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Proposition 5. Kt x — Kg for all d. 

Proof: It is clear that Kj s < Kj. To prove the 

converse inequality, let $5 € . Then there exist a 

Hilbert space K. and vectors in the unit ball of 
K such that 

Without loss of generality, we can assume that ^ 7^ for 
all k. Let 

^ = tIht and = fete)- 

lis* II 

Then $t & Sd i4Tj" 1 Conv(Dj), so there exist complex 
numbers afc,j of modulus not greater than 1 and positive 
numbers Xi such that 

*fcj = K d.l X i a ti a 3,i and XI Xi = 1 - 

It follows that 

Skj = K+^XiQMal^U^) e K+ 1 Conv(B+). 

We then deduce < □ 
Consequently, Kl[, w logd as <i — > 00. This implies 

that inclusion (|18l) is strict for large d. Therefore, for any 

sufficiently large d, there exists a Schur channel $ over 

L(7^d) that is not a mixture of unitary Schur channels. 

As direct consequences of the results in section V, such 

a Schur channel must violate AQBC. 

By Ref. [7J the first integer d for which Kf-, > 1 is 

d = 4. Thus K+ = K+ tl = 1 if and only if d < 3. This 

means that inclusion (|18p is an equality for d < 3 and 

becomes strict for c£ > 4. 

On the other hand, denote Kd the least constant K 

in (IT71) for a hxed d. Note that i-Tc = sup d iCj. It was 



proved independently by Davie and the third named au- 
thor that Ki = 1 (see the remark at the end of Chapter 
5 of Ref. [13] )■ It seems, however, that the first integer 
d for which Kd > 1 is still unknown. This problem is re- 
lated to the characterization of the extreme points of §<z- 
Indeed, Kd > 1 is equivalent to the existence of extreme 
points $5 of E>d that are not of the form S = [ce^Pj] for 
some complex numbers ak and j3k with |afe| = |/3fc| = 1 
for all k. Thus such extreme points exist for large d. It 
would be interesting to characterize the extreme points 
of §d in the spirit of Refs. @ and 0- 
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